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The  Final  Report  for  Award  Number  DAMD17-97-1-7070 

The  Proposal  Title:  The  Use  of  a  New  Technique  to  Study  DNA  Methylation  in  Breast 
Cancer 

I.  Introduction 

The  overall  goal  of  the  proposal  funded  by  the  Department  of  Defense  Breast  Cancer 
Research  Program,  was  to  study  DNA  methylation  in  human  breast  cancer  development, 
and  use  differentially  methylated  genomic  DNA  fragments  (DMGFs)  to  search  for  breast 
cancer  related  genes.  To  reach  this  objective,  our  specific  aims  were: 

1.  to  develop  and  refine  a  new  technique,  Methyl  Differential  Display  (MDD),  to  study 
the  role(s)  of  DNA  methylation  in  human  breast  cancer  development. 

2.  to  apply  this  technique  to  isolate  genomic  markers  which  detect  altered  DNA 
methylation  patterns  in  breast  cancer  cells. 

3.  to  search  for  new  types  of  oncogene(s)  whose  expressions  were  under  the  control  of 
DNA  methylation  mechanisms  by  DMGFs. 

4.  to  determine  the  biological  function(s)  of  candidate  gene(s). 

5.  to  search  for  the  candidate  gene's  potential  use  in  clinical  diagnosis  and  prognosis. 

In  the  past  three  years,  our  studies  progress  has  included  the  following  achievements: 
1).  To  develop  the  MDD  technique;  2).  To  isolate  DMGFs  by  employing  MDD;  3).  To 
discover  a  novel  gene,  TSP50 ,  by  a  hypomethylated  DMGF,  BR50.  4).  To  analyze  the  TSP50 
gene's  biological  characteristics  and  its  breast  cancer  related  features.  Our  research  found 
that  a.  The  TSP50  gene  was  specifically  expressed  in  human  testes;  b.  It  could  encode  a  new 
serine  protease;  c).  Its  expression  could  be  regulated  by  DNA  methylation;  d).  It  was 
abnormally  activated  in  some  breast  cancer  patients;  and  e).  Its  transcripts  were  located  in 
the  cytoplasm  of  the  neoplastic  epithelia  cells  in  breast  tissue. 

In  the  following  paragraphs,  a  detailed  report  of  our  achievements  will  be  presented. 
The  overall  findings  related  to  the  TSP50  gene  have  been  published  in  Cancer  Research ,  and 
presented  during  the  American  Association  for  Cancer  Research  (AACR)  annual  meeting 

for  the  year  2000  (see  Appendix). 

* 

II.  Body 

This  section  includes  two  parts,  the  experimental  procedures  which  were  employed 
for  this  research  and  the  major  results  obtained. 

II.A.  Experimental  Procedures 

II. A. 1.  DNA  from  human  cancer  biopsies.  Dissected  human  breast  and  ovarian  cancer 
tissues  (tumor  and  matched  normal)  were  immediately  frozen  in  liquid  nitrogen,  and 
stored  at  -70  °C.  DNAs  were  isolated  from  those  tissues  by  the  phenol  extraction  method 

(1). 
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II.A.2.  MDD.  1-2  pg  of  DNA  (tester)  isolated  from  human  breast  cancer  biopsies  and  their 
matched  normal  DNA  (driver)  were  cleaved  with  Msp  I  (20  U/(ll)  (Boehringer  Mannheim), 
and  Mse  I  (20  U/pl)  (New  England  Biolabs,  Inc.)  in  a  50  pi  reaction  for  3  hours.  To  prepare 
tester  and  driver  master  amplicons,  the  Msp  I  and  Mse  I  digested  tester  and  driver  genomic 
DNAs  were  ligated  to  1.5  pg  MSA24-mer  and  0.75  pg  MSA12-mer  (2);  these  were  the  first 
pair  of  oligonucleotide  linkers  which  only  recognize  the  ends  generated  by  Msp  I.  The 
procedures  for  amplicon  preparation  were  as  described  (2,3).  The  DNA  amplicons  were 
then  purified  by  phenol,  phenol/chloroform  extraction.  To  remove  the  first  set  of  linkers 
from  the  driver  amplicon,  80  pg  of  driver  amplicon  DNA  was  digested  with  the  Msp  I 
enzyme  (10  U/pl).  To  change  the  tester  master  amplicon  DNA  linkers,  5  pg  of  tester  master 
amplicon  was  digested  with  Msp  I  (20  U/pl)  and  ligated  to  0.6  pg  of  MSB24-mer,  and  0.3  pg 
of  MSB12-mer,  these  were  the  second  set  of  oligonucleotide  linkers.  Subtractive 
hybridization  was  performed  as  described  (2).  The  first  round  of  difference  products  (DPI) 
were  amplified  as  described  (2).  To  prepare  the  second  round  of  subtractive  hybridization,  3 
pg  of  DPI  was  digested  with  the  restriction  endonuclease  Msp  I  (20  U/pl).  To  put  a  new  set 
of  linkers  on  DPI,  O.lpg  of  DPI  was  mixed  with  0.6  pg  of  MSC24-mer,  and  0.3  pg  of  MSC12- 
mer.  Another  round  of  subtractive  hybridization/PCR  amplification  was  repeated. 
Difference  product  2  (DP2)  usually  contained  several  individual  DNA  fragments  when 
electrophoresed  on  a  2%  agarose  gel.  The  individual  DNA  fragments  were  purified  by 
DNA  gel  extraction  kit  (Qiagen  Inc.),  and  subcloned  into  pUC118  vector,  which  was 

linearized  by  the  restriction  endonuclease  Acc  I,  and  transformed  into  E.  Coli  (DH5a). 
Twelve  cloned  inserts  were  chosen  to  be  amplified,  from  which  different  sized  probes  were 
selected  for  master  amplicon  southern  blotting,  and  human  genomic  DNA  southern 
blotting. 

II.A.3.  Amplicon  DNA  southern  blot.  The  first  round  of  positive  probe  screening  was 
performed  with  amplicon  DNA  southern  blots.  Non-Radiation  Southern  Blot  and 
Detection  Kits  (Genius™)  were  purchased  from  Boehringer  Mannheim.  Probe  labeling  and 
detection  followed  the  instructions  of  the  manufacturer.  2-3  pg  of  tester  and  driver 
amplicon  DNA  were  electrophoresed  on  a  2%  agarose  gel,  and  blotted  to  positively  charged 
nylon  membranes  (Boehringer  Mannheim).  For  prehybridization,  the  membranes  were 
placed  at  68  °C  for  2-4  hours  in  solutions  containing  6  X  SSC,  5  X  Denhardt's  solution,  0.5% 
SDS,  0.1  M  EDTA,  and  50  pg/ml  of  salmon  sperm  DNA.  Under  the  same  conditions,  the 
probes  were  added,  and  hybridized  to  the  membranes  overnight.  The  membranes  were 
then  rinsed  three  times  with  2  X  SSC,  1  X  Blot  wash  (12  mM  Na2HP04,  8  mM  NaH2P04,  1.4 
mM  Na4P2C>7,  0.5%  SDS)  at  68  °C,  and  further  washed  three  times  (30  minutes  each)  with 
the  same  buffer  at  68  oC.  Next,  the  membranes  were  equilibrated  in  buffer  A  (100  mM 
Tris.HCl,  150  mM,  pH  7.5)  and  transferred  into  buffer  B  (2%  block  reagent  in  buffer  A)  which 
was  incubated  at  room  temperature  for  one  hour.  The  membranes  were  then  washed  2 
times  for  15  minutes  with  buffer  A,  and  equilibrated  in  buffer  C  (100  mM  Tris.HCl,  100  mM 
NaCl,  10  mM  MgCl2).  Before  the  membranes  were  exposed  on  Kodak  X-OMAT  film  for  one 
hour,  they  were  rinsed  in  lumi-P530  for  1  min  and  kept  in  a  plastic  sheet  protector. 

II.A.4.  Genomic  DNA  southern  blot.  The  positive  probes  confirmed  by  amplicon  DNA 
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southern  blot  experiments  were  tested  further  by  human  genomic  DNA  southern  blotting. 
Genomic  DNAs  isolated  from  tumor  tissues,  and  their  respective  matched  normal  tissues, 
were  digested  with  Msp  I  (20  U/pl),  and  electrophoresed  on  1.5%  agarose  gels,  which  were 
then  transferred  to  Hybond  N-membranes  (Amersham,  Arlington  Heights,  IL).  These 
membranes  were  exposed  to  UV  light  to  immobilize  the  DNA.  The  probes  for  the  southern 
blot  were  labeled  with  High  Prime  DNA  labeling  kits  (Boehringer  Mannhein)  following  the 
instructions  of  the  manufacturer.  The  procedure  for  hybridization  and  blot  wash  were  the 
same  as  in  the  Amplicon  DNA  southern  blotting  section. 

II.A.5.  DNA  sequence  and  chromosome  assignment.  The  pUC118  plasmid  containing  the 
candidate  DP2  fragment  was  sequenced  using  the  Ampli-Cycle  sequencing  kit  (Perkin- 
Elmer),  under  conditions  described  by  the  manufacturer.  Chromosome  assignment  for  the 
candidate  DP2  fragment  was  determined  by  genomic  southern  blotting  of  the  Hind  III 
digested  monochromosomal  human/rodent  somatic  cell  hybrid  mapping  panel  #2  (NIGMS 
Human  Genetic  Mutant  Cell  Repository)  while  it  was  used  as  a  probe.  Fine  chromosome 
mapping  was  performed  with  Genebridge  4  Radiation  Hybrid  Panel  (Research  Genetics,  Inc.) 
by  PCR  amplification^). 

II.A.6.  Human  genomic  DNA  library  screening.  A  human  placenta  genomic  phage  library, 
EMBL3  SP6/7  (Clontech,  Inc.),  was  used  for  cloning  a  longer  genomic  fragment  containing 
the  candidate  fragment.  Phage  infection  procedure  was  based  on  the  instructions  supplied 
by  the  manufacturer.  2  X  106  plaques  were  evenly  distributed  on  20  plates  (150  X  15  mm) 
and  then  transferred  onto  Hybond  N-membranes  (Amersham,  Arlington  Heights,  IL).  The 
treatment  of  the  membranes,  preparation  of  the  probe,  and  the  blot  wash  were  the  same  as 
that  described  in  the  Genomic  DNA  southern  blot  section.  The  phage  DNA,  with  human 
DNA  insert,  was  purified  by  the  Lambda  TRAP  Plus  Kit  (Clontech,  Inc.)  following  the 
instructions  of  the  manufacturer.  The  individual  insert  was  released  by  restriction  enzyme 
Sst  I  cleavage  from  the  phage  DNA  arms,  then  subcloned  into  pUC118  plasmid. 

II.A.7.  Northern  analysis.  Two  Human  Multiple  Tissue  Northern  Blot  panels,  MTN™, 
and  MTN™  II,  were  purchased  from  Clontech  Inc.  The  MTNTM  blot  contains 
approximately  2  [ig  of  polyA+  RNA  per  lane  from  eight  different  human  tissues  (heart, 
brain,  placenta,  lung,  liver,  skeletal  muscle,  kidney,  and  pancreas).  The  MTN™  II  blot 
contains  the  same  amounts  of  mRNA  from  an  additional  eight  different  human  tissues 
(spleen,  thymus,  prostate,  testes,  ovary,  small  intestine,  colon,  and  peripheral  blood 
leukocyte).  The  probe  labeling  and  detection  was  the  same  as  above. 

II.A.8.  Human  complimentary  DNA  (cDNA)  library  screening.  A  human  testes  Xgtll 
cDNA  library  (Human  Testis  cDNA  library,  Clontech.  Inc.)  was  used  to  obtain  an  intact  gene 
following  the  instructions  of  the  manufacturer.  2  X  105  plaques  evenly  distributed  on  6 
plates  (150  X  15  mm)  were  transferred  onto  Hybond  N-membranes.  Southern  analysis  was 

performed  as  before.  The  phage  DNA,  with  human  cDNA  insert,  was  purified  by  the  X 
Quick!  Spin  Kit  (BIO  101,  Inc.)  following  the  instructions  of  the  manufacturer.  The 
individual  insert  was  released  by  restriction  enzyme  Eco  RI  cleavage  from  the  phage  DNA 
arms,  then  subcloned  into  pUC118  plasmid. 
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II. A. 9.  Reverse  Transcription-PCR  (RT-PCR).  Total  RNAs  were  isolated  from  paired  breast 
cancer  and  normal  tissues  by  RNA  isolation  kit,  RNA  STAT-60  (TEL-TEST,  Inc.).  The  first 
strand  cDNA  was  synthesized  by  Superscript  Preamplification  system  kit  (GIBCOBRL,  Life 
Technologies).  Oligomers  E  (ACCAGAGCGTCCAGTGTGTCC,  sense)  and  F 
(TGGGACTTGATGATCTGAACC,  antisense)  were  used  to  synthesize  the  TSP50  gene.  The 
predicted  size  was  699  bp.  fi-actin  was  used  as  an  internal  control  whose  sense  and  antisense 
primers  were:  5'-G  ACG  ACAT GGAGA  AGATCTGG-3  '  and  5'- 

TGTAGAGGTAGTCAGTCAGG-3'.  The  predicted  size  for  fi-actin  was  335  bp.  The  PCR 
reaction  mixture  consisted  of  cDNA  derived  from  125  ng  of  RNA,  10  pmole  of  sense  and 
antisense  primers  from  both  TSP50  and  fi-actin,  200  |iM  of  four  deoxynucleotide 
triphosphate,  and  0.125  unit  of  Taq  DNA  polymerase  with  reaction  buffer  (Perkin  Elmer)  in 
a  final  volume  of  25  |il.  Thirty  eight  cycles  of  PCR  were  carried  out.  Each  cycle  of  PCR 
included  30  seconds  of  denaturation  at  95  oC,  60  seconds  of  annealing  at  60  oC,  and  60 
seconds  of  extension  at  72  oC.  The  PCR  products  were  separated  on  a  2%  agarose  gel. 

II.A.10.  In  situ  hybridization.  A  700  bp  length  of  the  3'  end  anti-sense  TSP50  RNA,  or  its 
sense  version  which  served  as  a  negative  control,  was  labeled  with  digoxigenin  (Dig)  and 
hybridized  to  properly  treated  breast  cancer  tissue  sections  embedded  in  paraffin.  After 
incubation  with  anti-Dig-antibody  conjugated  biotin,  the  expression  of  TSP50  in  each  tissue 
section  was  detected  by  streptavidin  alkaline  phosphatase  and  biotin  complex  (brown  color). 
Hematoxylin  was  used  for  counterstaining  (background  stain,  blue  color).  The  colored 
signals  were  visible  by  light  microscopy  and  the  results  were  examined  by  two  pathologists. 


II. B.  Results 

III. B.l.  Isolation  of  hypomethylated  sequences  from  human  breast  cancer  biopsies.  The 

DNAs  isolated  from  five  paired  human  breast  cancer  biopsies  (tester)  and  their  surrounding 
normal  tissues  (driver)  were  cleaved  with  the  Msp  I  and  Mse  I  enzymes.  The  tester  and 
driver  amplicons  with  both  ends  cleaved  by  Msp  I  were  selectively  prepared  by  PCR 
amplification  (see  Materials  and  Methods).  After  two  rounds  of  DNA 
hybridization/subtraction  and  PCR  amplification,  individual  fragments  (DP2)  were  isolated 
from  two  breast  cancer  patients  (see  Materials  and  Methods).  The  DP2  fragments  were 
subcloned  into  the  pUC118  vector,  and  the  inserts  were  amplified  by  PCR.  12  different  sized 
inserts  were  selected  from  each  MDD  and  used  as  probes  for  the  master  amplicon  southern 
blotting.  A  total  of  four  probes,  BR50,  BR97,  BR104,  and  BR254  were  identified  as  candidates 
(Table  1). 

Genomic  Southern  blot  suggested  that  all  four  fragments  were  hypomethylated  in  the 
original  patient,  and  also  in  some  other  breast  and  ovarian  cancer  patients.  However,  since 
BR50  was  located  in  the  chromosome  3pl2-14  region,  whose  abnormality  was  common  in 
many  different  types  of  cancers,  we  decided  to  focus  our  attention  on  studying  this  fragment. 

II.B. 2.  Search  for  the  coding  regions  by  probe  BR50.  Since  BR50  is  a  genomic  fragment  the 
chance  that  it  encodes  a  polypeptide  are  slim  because  a  majority  of  human  genomic  DNA 
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sequences  are  noncoding  sequences.  As  a  result,  we  decided  that  our  first  step  in  searching 
for  a  gene  should  be  to  isolate  a  longer  DNA  piece  from  a  human  genomic  DNA  library, 
with  the  hope  that  it  may  contain  exon(s).  It  is  an  accepted  fact  that  DNA  methylation  sites 
can  be  near  genes  (5),  and  with  this  in  mind  we  decided  to  screen  a  human  placenta 
genomic  phage  library,  EMBL3  SP6/7  (Clontech,  Inc.),  where  the  insertion  sizes  were 
relatively  small  (9  to  28  kbp).  For  this  purpose,  2  X  106  plaques  derived  from  the  library  were 
screened  by  probe  BR50.  As  a  result,  a  17  kbp  length  clone  was  isolated.  To  obtain  sequence 
information,  the  DNA  clone  was  released  from  the  phage  DNA  arms  by  restriction  enzyme 
Sst  I  cleavage  which  generated  eight  DNA  fragments.  All  eight  fragments  were  subcloned 
into  pUC118  plasmids.  The  fragments  smaller  than  1  kbp  were  completely  sequenced,  while 
the  fragments  larger  than  1  kbp  were  partially  sequenced  from  both  the  plasmid  and  insert 
junctions.  A  homolog  search  of  the  NIH  GeneBank  revealed  two  exons,  BR50-44  and  BR50- 
45,  which  contained  112  and  132  nuecleotides,  respectively.  Both  exons  encoded 
polypeptides  which  were  about  50%  identifiable  to  several  mammalian  proteases,  such  as 
serine  proteases  and  tryptases.  The  BR50-45  sequence  was  found  142  bps  up  stream  of  the 
BR50  sequence. 

Since  we  did  not  know  the  exact  positions  of  the  two  exons  in  the  17  kbp  fragment,  it 
was  possible  that  other  exon(s)  might  lay  between  them.  To  gain  a  longer  coding  sequence, 
we  designed  four  oligomers  (A,  B,  C,  D)  based  on  both  exon's  sequence  information  to 
perform  PCR.  Oligomer  A  (5'-CCTGGATGGTCAGCGTG-3')  and  B  (5'- 
CTGGGAGGCAATGATGGT-3'),  which  were  on  the  complimentary  strand,  were  based  on 
the  sequence  information  of  BR-44;  and  C  (5'-CTGGAGAGCCCTTGGTCT-3  )  and  D  (5  - 
CAGTGTTGGTAGGAGGAG-3'),  which  were  on  the  complimentary  strand,  were  based  on 
the  sequence  information  of  BR-45.  A  strategy  using  four  different  combinations  of 
oligomer  pairs  was  employed  to  perform  PCR  by  utilizing  the  Human  Universal  cDNA 
Library  Panel  (Clontech  Inc.).  A  PCR  product  which  was  about  700  bps  in  length  was 
generated  from  one  oligomer  combination  (A/D,  5'-CCTGGATGGTCAGCGTG-3'/5'- 
CAGTGTTGGTAGGAGGAG-3').  This  PCR  product  was  directly  sequenced.  Combining  the 
sequence  information  of  the  PCR  product  and  the  two  exons,  we  obtained  a  cDNA  fragment 
which  contained  974  bps.  A  DNA  homolog  search  of  the  NIH  GeneBank  revealed  again, 
that  it  coded  for  a  protease  like  protein,  and  the  overall  identity  was  approximately  40%. 

II.B.3.  The  candidate  gene  is  highly,  and  specifically  expressed  in  human  testes.  To  obtain  a 
full  length  cDNA,  it  is  critical  to  use  the  right  cDNA  library  where  the  gene  of  interest  is 
expressed.  Thus,  two  Human  Multiple  Tissue  Northern  Blot  panels,  MTN^M  and  M^TN^m 
II,  containing  16  different  tissue  mRNAs  (Clontech  Inc.),  were  used  to  test  the  expression  of 
the  candidate  gene  by  using  the  700  bp  cDNA  PCR  product  as  a  probe.  Our  finding  revealed 
that  there  were  no  visible  transcripts  of  this  gene  in  the  eight  mRNAs  (heart,  brain, 
placenta,  lung,  liver,  skeletal  muscle,  kidney,  and  pancreas)  included  in  the  MTN™  panel 
(data  not  shown).  However,  in  the  MTN™  II  panel,  a  1.7  kb  band  was  heavily  hybridized  by 
the  probe  exclusively  in  the  testes  mRNA  as  compared  to  the  control  probe,  which  was  the 
human  Yub6  gene  (Fig.  1).  These  results  suggested  that  the  gene  that  we  were  searching  for 
was  a  tissue  specific  gene.  We  have  named  the  gene  TSP50  (Testes  Specific  Protease).  At 
this  moment,  the  gene's  biological  function(s)  in  human  testes  remain  unknown. 
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II.B.4.  Isolation  of  the  full  length  TSP50  gene  from  a  human  testes  cDNA  library.  Since 
TSP50  is  highly  expressed  in  human  testes,  to  search  for  the  full  length  TSP50  gene,  a 
human  testes  cDNA  library  (Human  Testis  cDNA  library,  Clontech.  Inc.)  was  screened  by 
the  700  bp  TSP50  cDNA  sequence.  A  cDNA  clone  containing  the  probe  sequence  was 
isolated.  Sequence  analysis  indicated  that  this  fragment  encoded  a  protein  with  385  animo 
acids  (Fig.  2.).  There  is  a  stop  codon  located  at  the  117th  bp  up  stream  of  the  initial 
translation  site,  and  there  is  a  125  bp  untranslated  region  before  a  polyadenosine  sequence. 
These  results  implied  that  a  full  length  gene  thus  had  been  obtained.  It  was  also  notable 
that  the  BR50  probe  sequence  is  located  at  the  3'  end  of  the  gene,  and  is  only  17  bps  down 
stream  from  the  polyadenosine  adding  signaling  site.  The  exons,  BR50-44  and  BR50-45, 
encode  animo  acids  from  103  to  157  and  308  to  385,  respectively.  The  3'-  untranslated  region 
before  the  polyadenosine  site  was  also  included  in  the  sequence  of  BR50-45. 

II. B. 5.  DNA  methylation  status  of  the  TSP50  gene  in  human  testes  and  other  normal 
tissues.  Our  studies  have  proven  that  TSP50  is  a  tissue  specific  gene,  and  the  methylation 
patterns  in  its  3'-  region  were  altered  in  some  breast  and  ovarian  cancers.  It  is  also  common 
knowledge  that  many  tissue  specific  genes  are  methylated,  and  this  methylation  may 
regulate  their  expression  (6-8).  To  explore  the  possible  relationship  between  TSP50  gene 
expression  and  DNA  methylation  in  different  normal  human  tissues,  southern  analysis 
was  performed.  The  normal  tissues  tested  included  the  testes,  where  TSP50  was  expressed, 
and  bladder,  blood,  breast,  colon,  lung,  kidney,  placenta,  and  ovary  samples,  where  TSP50 
was  apparently  not  expressed.  To  perform  the  southern  analysis,  BR50  was  used  as  a  probe. 
DNAs  isolated  from  the  nine  tissues  were  digested  by  Msp  I,  and  Hpa  II,  which  is  an 
isoschizomer  of  the  Msp  I  enzyme  and  the  most  popular  enzyme  used  to  study  DNA 
methylation  patterns  (9).  Hpa  II  digestion  showed  that  in  the  testes  DNA,  two  bands, 
probably  released  from  each  allele  by  enzyme  cleavage,  were  hybridized  by  the  probe. 
However,  in  the  other  tissues'  DNAs,  the  corresponding  bands  were  either  not  hybridized, 
or  hybridized  to  a  much  smaller  degree  (Fig.  3a).  For  Msp  I  cleavage,  both  bands  were 
released  in  different  tissues  to  various  extents  (Fig.  3b).  Both  blots  utilized  a  genomic 
fragment  which  did  not  detect  differential  DNA  methylation  as  a  control  to  determine 
complete  enzymatic  digestion  (Fig.  3).  These  results  demonstrated  that  the  TSP50  gene  was 
differentially  methylated  in  various  human  tissues.  In  general,  DNA  demethylation  in  the 
testes  is  correlated  with  high  levels  of  gene  expression.  Conversely,  DNA  methylation  is 
correlated  with  gene  silencing  in  the  bladder,  blood,  breast,  colon,  lung,  kidney,  placenta, 
and  ovary  tissues. 

II.B.6.  Comparison  of  the  TSP50  product  sequence  with  other  serine  proteases.  Sequence 
analysis  revealed  that  the  TSP50  gene  encodes  a  protein  which  shares  approximately  40% 
identity  with  mammalian  serine  proteases.  Figure  4  compares  the  TSP50  animo  acid 
sequence  with  7  other  serine  proteases  including  prostasin  (10),  plasma  kallikrein  (11), 
coagulation  factor  XI  (12),  fi-tryptase  (13),  hepsin  (14),  plasminogen  (15),  and  acrosin  (16). 
Proteolytic  enzymes  dependent  on  a  serine  residue  for  catalytic  activity  are  widespread  and 
very  numerous.  Serine  proteases  are  found  in  viruses,  bacteria,  and  eukaryotes,  and  they 
include  exopeptidases,  endopeptidases,  oligopeptidases,  and  omega  peptidases.  Over  20 
families  of  serine  peptidases  are  recognized  (17,18),  and  grouped  into  clans  that  may  have 
common  ancestors.  The  peptidases  of  chymotrypsin,  subtilisin,  and  carboxypeptidase  C 


clans  have  in  common  a  "catalytic  triad"  of  three  amino  acids:  Serine  (Ser,  nucleophile). 
Aspartate  (Asp,  electrophile),  and  Histidine  (His,  base).  However,  there  are  some  serine 
peptidases  that  have  distinctive  mechanisms  of  action  without  the  classic  Ser,  His,  Asp 
triad.  The  multiple  sequence  alignment  for  tsp50p  showed  that  it  contains  triad  Hisi53  and 
Asp206.  However,  the  Ser  at  position  310  has  been  replaced  by  threonine  (Thr)  (Fig.  4).  The 
corresponding  nucleotides  for  coding  Thr  in  the  TSP50  gene  were  ACT,  while  one  base  pair 
switch,  such  as  C  to  G,  will  result  in  a  Ser  codon,  AGT.  As  a  result,  one  may  wonder 
whether  this  change  was  caused  by  a  point  mutation  happening  in  the  cells,  or  an  error  in 
DNA  sequencing.  Based  on  our  experience,  these  assumption  are  unlikely  since  the  DNA 
fragment  containing  this  codon  was  isolated  from  two  individual  libraries,  the  human 
placenta  genomic  phage  library  and  testes  cDNA  library,  and  the  ACT  codon  was  verified  by 
DNA  sequencing  in  both  fragments.  This  implies  that  the  Ser  was  replaced  by  Thr  in  the 
predicted  Ser  triad  of  tsp50p.  However,  Thr  and  Ser  residues  are  structurally  similar  (Thr 
has  an  extra  methyl  group  as  compared  to  Ser).  Both  Thr  and  Ser  contain  the  HO  group  that 
is  critical  for  enzymatic  catalysis.  In  addition,  the  Thr  residue  in  tsp50p  was  surrounded 
with  conserved  residues  including  a  crucial  glycine  (Gly)  (Fig.  4).  Usually  the  linear  order  of 
catalytic-site  residues,  clusters  of  conserved  amino  acids  around  the  catalytic  residues,  are 
important  factors  to  classify  a  protease  (17,18).  Therefore,  tsp50p  could  be  a  new  type  of 
serine  protease,  possibly  with  a  distinctive  mechanism  of  action. 

II.B.7.  TSP50  was  differentially  expressed  in  some  breast  cancer  tissues.  Preliminary  results 
demonstrated  that  TSP50  was  differentially  methylated  in  40%  of  the  breast  cancer  tissues 
tested.  This  suggested  that  it  could  also  be  differentially  expressed  in  breast  cancer.  To  test 
this  possibility,  RT-PCR  was  carried  out  to  determine  TSP50  expression  levels  in  eighteen 
paired  breast  cancer  tissues.  Our  findings  showed  that  TSP50  PCR  products  were  generated 
in  five  tumor  tissues,  while  in  their  normal  controls,  they  were  not  visible  relative  to  the 
control  gene,  fi-cictin  (Fig.  5).  Products  generated  from  the  five  patients  were  gel  purified 
and  sequenced.  DNA  sequence  analysis  confirmed  that  the  PCR  products  synthesized  were 
the  TSP50  gene  (data  not  shown).  Therefore,  among  the  eighteen  paired  samples  tested, 
28%  of  the  tissues  expressed  the  TSP50  gene.  At  this  moment,  we  can  not  answer  the 
question  of  whether  activation  of  the  TSP50  gene  in  cancer  is  a  consequence,  or  a  causal 
factor  of  neoplastic  growth.  To  find  the  truth,  it  will  be  necessary  to  perform  in  vitro 
cellular  transformation  and  in  vivo  tumor  induction  assays. 

II.B.8.  In  situ  hybridization  confirmed  that  TSP50  was  expressed  in  breast  cancer  cells. 
Although  RT-PCR  detected  TSP50  gene  activation  in  some  breast  cancer  biopsies,  this 
experiment  did  not  clarify  whether  this  gene  was  expressed  in  the  cancer  or  stroma  cells.  To 
find  the  answer,  In  Situ  Hybridization  (ISH)  assay  was  performed,  where  an  anti-sense  probe 
was  used  to  detect  the  TSP50  gene  expression  in  breast  cancer  tissue  sections.  At  this  point, 
three  breast  cancer  and  one  benign  tumor  have  been  tested.  The  results  found  that  the  anti- 
sense  probe  detected  TSP50  transcripts  (brown  color)  in  cancer  cells  of  an  advanced  cancer 
specimen.  Some  of  the  cancer  cells  (epithelia  cells)  were  stained  heavily  (darker  brown) 
(Fig.  10a)  in  comparison  to  other  cancer  cells  (lighter  brown)  (Fig.  10b).  The  reaction  product 
in  the  adjacent  extracellular  matrix  (Fig.  10a  and  b)  is  likely  to  be  due  to  the  binding  of 
brown  oxidized  diaminobenzide,  which  has  diffused  from  the  original  intracellular  site  of 
reaction.  In  the  negative  controls,  the  anti-sense  and  sense  probe  did  not  stain  normal  or 


cancer  breast  epithelia  cells  (Fig.  IOC  and  Fig.  10D).  Therefore,  they  only  exhibit 
counterstaining  (blue).  The  other  two  breast  cancer  and  benign  samples  were  not  stained  by 
the  anti-sense  probes.  (Data  not  shown).  These  results  demonstrated  that  the  TSP50  gene 
was  activated  in  some  breast  cancer  cells,  which  indicated  that  this  gene  could  be  involved 
in  neoplastic  evolution,  and  perhaps  metastatic  progression. 

III.  Key  Research  Accomplishments 

The  key  research  accomplishments  of  our  study  are: 

1) .  The  establishment  of  a  new  technique,  MDD; 

2) .  The  successful  generation  of  DMGFs  by  MDD,  which  will  be  useful  tools  for 

new  breast  cancer  related  genes; 

3) .  The  acquisition  of  a  novel  gene,  TSP50; 

4) .  The  discovery  of  abnormal  activation  of  TSP50  in  breast  cancer; 

5) .  The  determination  of  abnormal  activation  of  TSP50  in  epithelia  breast  cancer 

cells. 

IV.  Reportable  Outcomes 

1) .  The  studies  on  obtaining  and  characterizing  the  TSP50  gene  have  been 

published  in  the  Cancer  Research  journal  (see  appendix). 

2) .  An  abstract  was  presented  about  TSP50  at  the  2000  AACR  meeting. 

3) .  A  pending  patent  has  been  filed  for  the  TSP50  gene. 

4) .  Established  cell  lines  that  integrated  the  TSP50  gene  into  their  genomes. 

5) .  NIH  funding  was  applied  for  based  on  the  TSP50  gene  study. 

6) .  The  postdoctoral  fellow  who  worked  on  the  proposal  was  trained  by  this 

award. 

VI.  Conclusions 

The  important  and  unique  feature  of  the  new  Methyl-Differential  Display  technique 
is  to  selectively  analyze  CG  rich,  differentially  methylated  sequences,  which  are  usually  close 
to  coding  regions,  by  Msp  I  and  partner  enzyme  cleavage.  To  date  four  DMGFs,  which 
detected  DNA  hypomethylation,  were  isolated  from  breast  cancer  patients.  Among  them 
three  also  detected  DNA  hypomethylation  in  ovarian  cancer  samples  (Table  1).  All  the 
DMGFs  had  high  GC  contents,  and  one  DMGF,  BR50,  was  successfully  used  as  a  probe  to 
discover  the  coding  region  which  was  only  140  bps  away.  These  regions'  sequencing 
information  led  to  the  discovery  of  a  974  bp  gene  fragment  from  a  human  cDNA  library 
panel  by  PCR  amplification.  To  obtain  a  full  length  gene,  a  northern  evaluation  of  sixteen 
different  types  of  human  RNAs  was  performed.  The  results  demonstrated  that  the  target 
gene  was  specifically  expressed  in  human  testes  tissue.  This  information  secured  the 
isolation  of  an  intact  gene,  TSP50,  by  screening  a  human  testes  cDNA  library.  The  sequence 
analysis  revealed  that  the  TSP50  gene  most  likely  encodes  a  serine  protease. 

It  is  well  known  that  proteases  of  all  major  classes  (i.e.,  serine,  aspartic,  cysteine, 


threonine,  and  metalloproteinases)  are  linked  with  various  malignancies,  especially  those 
exhibiting  the  metastasis  phenotype.  For  example,  the  prostate  specific  antigen  (PSA)  is  a 
kallikrein-like  serine  protease  that  is  utilized  as  a  clinical  marker  for  the  diagnosis  and 
staging  of  prostate  cancer  where  its  preferential  expression  in  prostate  epithelial  cells  is 
increased  (15-20).  In  addition,  the  matrix  metalloproteinases  (MMPs)  have  been  repeatedly 
implicated  in  metastasis  (21-23).  Since  the  TSP50  gene  product  (tsp50p)  could  be  a  serine 
protease,  and  was  activated  in  breast  cancer  cells,  it  is  logical  to  proceed  to  the  next  step  and 
ask  whether  this  gene  could  play  a  role  in  breast  cancer  invasiveness? 

It  is  common  knowledge  that  many  tissue  specific  genes'  expression  is  regulated  by 
DNA  methylation  which  usually  modifies  the  promoter,  or  sometimes  the  3'-  regions  (2-4, 
24).  Our  preliminary  results,  although  only  obtained  from  analyzing  the  DNA  methylation 
status  of  the  3'  flanking  region  of  the  gene,  have  proven  that  TSP50  is  one  of  those  tissue 
specific  genes.  It  will  be  interesting  to  discover  whether  the  gene's  promoter  region  is  also 
methylated  when  the  corresponding  sequence  information  is  available.  The  Hpa  II  and  Msp 
I  methylation  sensitive  southern  analysis  of  the  TSP50  gene's  3'-  region  demonstrated  that, 
in  Hpa  II  digested  DNAs,  probe  BR50  hybridized  two  bands  in  the  testes  tissue,  but  none  in 
the  other  samples.  The  lower  band,  which  was  the  same  size  as  the  probe,  represented  the 
unmethylated  DNA  pattern,  while  the  upper  band  obviously  contained  the  internal  Hpa  II 
recognition  site(s)  which  remained  methylated.  In  Msp  I  digested  DNAs,  the  upper  band 
was  dominant  in  most  tissues,  while  in  the  testes,  the  lower  band  was  dominant.  These 
results  suggest  that  the  GGCCGG  end  of  BR50  was  methylated  in  other  tissues,  but  not  in  the 
testes.  The  DNA  methylation  patterns  observed  in  both  blots  are  probably  allelic  oriented. 
It  seems  that  DNA  hypomethylation  was  accompanied  by  the  gene's  expression  in  the  testes, 
and  conversely,  DNA  hypermethylation  was  accompanied  by  the  gene's  silencing  in  other 
tissues.  The  correlation  between  DNA  methylation  and  gene  expression  provided 
additional  proof  that  DNA  methylation  could  be  an  important  mechanism  in  governing 
the  genes'  expression  in  various  differentiated  human  cells. 

The  differential  expression  of  the  TSP50  gene  has  been  tested  in  eighteen  paired  breast 
cancer  biopsies.  Our  findings  have  shown  that  this  gene  was  activated  in  five  cancer 
samples.  This  finding  indicates  that  this  novel  gene's  expression  is  related  to  breast  cancer 
progression.  In  addition,  in  situ  hybridization  has  demonstrated  that  the  gene  was  activated 
in  the  epithelia  cancer  cells.  We  believe  that  the  cellular  location  of  the  PSP 50  gene  product 
will  soon  be  uncovered  since  the  antibody  for  TSP50  is  now  available.  By  performing  the 
immunohistochemistry  technique,  the  statistic  data  for  activation  of  this  new  gene  in  breast 
cancer,  as  well  as  other  cancers  will  be  obtained.  This  information  will  help  us  to 
understand  whether  the  TSP50  gene  can  be  considered  as  a  bio-marker  for  diagnosis  or 
prognosis  of  human  breast  cancer. 


VII.  References 

1.  Sambrook,  J.,  Fritsch,  E.  F.  and  Manitis,  T.  in  Molecular  Cloning  Manual,  2nd  ed. 
(ed.  Ford,  N.  Nolan,  C.  and  Ferguson,  M.)  9.16-9.19  (Cold  Spring  Harbor  Laboratory 
Press,  Cold  Spring  Harbor,  New  York,  1989). 


1  3 

2.  Yuan,  L.  M.,  Shan,  J.  D.,  De  Risi,  D.,  Broome,  J.,  Lovecchio,  J.,  Gal,  D.,  Vinciguerra,  V., 
and  Xu,  H.  P.,  1999.  Isolation  of  a  novel  gene,  TSP50,  by  a  hypomethylated  DNA 
fragment  in  human  breast  cancer.  Cancer  Res.,  59,  3215-3221. 

3.  Lisitsyn,  N.,  Lisitsyn,  N.  and  Wigler,  M.  Cloning  the  differences  between  two  complex 
genomes.  Science  259:  946-951,  1993. 

4.  Walter,  M.  A.,  Spillett,  D.  J.,  Thomas,  P.,  Weissenbach,  J.,  and  Goodfellow,  P.  N.,  A 
method  for  constructing  radiation  hybrid  maps  of  whole  genomes.  Nature  Genet.  7: 
2-28, 1994. 

5.  Larsen,  F.,  Gunderson,  G.,  Lopez,  R.,  and  Prydz,  H.  CpG  islands  as  gene  markers  in 
the  human  genome.  Genomics,  13:  1095-1107,  1992. 

6.  Bird,  A.  Gene  number,  noise  reduction  and  biological  complexity.  Trends  Genet., 

11:  94-100, 1995. 

7.  Kass,  S.  U.,  Pruss,  D.,  and  Wolffe,  A.  P.  How  does  DNA  methylation  repress 
transcription?  Trends  Genet.,  13:  444-449, 1997. 

8.  Siegfried,  Z.,  and  Cedar,  H.  DNA  methylation:  a  molecular  lock.  Curr.  Biol.,  7: 
R305-R307,  1997. 

9.  Nelson,  M.,  and  McClelland,  M.  Site-specific  methylation:  effect  on  DNA 
methylation  methyltransferases  and  restriction  endonucleases.  Nucl.  Acids  Res., 
19:  2045-2071, 1991. 

10.  Yu,  J.  X.,  Chao,  L.,  and  Chao,  J.,  1995.  Molecular  cloning,  tissue-specific  expression, 
and  cellular  location  of  human  prostasin  mRNA.  J.  Biol.  Chem.  270,  13483-13489. 

11.  Chung,  D.  W.,  Fujikawa,  K.,  McMullen,  B.  A.,  and  Davie,  E.  W.,  1986.  Human  plasma 
prekallikrein,  a  zymogen  to  a  serine  protease  that  contains  four  tandem  repeats. 
Biochemistry  25,  2410-2417. 

12.  Fujikawa,  K.,  Chung,  D.  W.,  Hendrickson,  L.  E.,  and  Davie,  E.  W.,  1986.  Amino  acid 
sequence  of  human  factor  XI,  a  blood  coagulation  factor  with  four  tandem  repeats  that 
are  highly  homologous  with  plasma  prekallikrein.  Biochemistry  255,  2417-2424. 

13.  Vanderslice,  P.,  Ballinger,  S.  M.,  Tam,  E.  K.,  Goldstein,  S.  M.,  Craik,  C.  S.,  and 
Caughey,  G.  H.,  1990. Human  mast  cell  tryptase:  multiple  cDNAs  and  genes  reveal  a 
multigene  serine  protease  family.  Proc.  Natl.  Acad.  Sci.  U.  S.  A.  87,  3811-3815. 

14.  Leytus,,  S.  P.,  Loeb,  K.  R.,  Hagen,  F.  S.,  Kurachi,  K.,  and  Davie,  E.  W.,  1988.  A  novel 
trypsin-like  serine  protease  (hepsin)  with  a  putative  transmembrane  domain 
expressed  by  human  liver  and  hepatoma  cells.  Biochemistry  27,  1067-1074. 

15.  Forsgren,  M.,  Raden,  B.,  Israelsson,  M.,  Larsson,  K.,  and  Heden,  L.-O.,  1987.  Molecular 
cloning  and  characterization  of  a  full-length  cDNA  clone  for  human  plasminogen. 
FEBS  Lett.  213,  254-260. 

16.  Adham,  I.  M.,  Klemm,  U.,  Maier,  W.-M.,  and  Engel,  W.,  1990.  Molecular  cloning  of 
human  preproacrosin  cDNA.  Hum.  Genet.  84.  125-128. 

17.  Barrett,  A.  J.,  1994.  Classification  of  peptidases.  Methods  in  Enzymol.  244, 1-14. 

18.  Rawlings,  N.  D.,  and  Barrett,  A.  J.,  1994.  Families  of  serine  peptidases.  Meth.  Enzymol. 
244, 19-31. 

19.  Wang,  M.  C.,  Valenzuela,  L.A.,  Murphy,  G.  P.,  and  Chu,  T.  M.,  1979.  Purification  of  a 
human  prostate  specific  antigen.  Invest.  Urol.  17,  159-163. 

20.  Yoshihito,  B.,  Wang,  M.  C.,  Watt,  K.  W.  K.,  Loor,  R.,  and  Chu,  T.  M.,  1984.  The 
proteolytic  activity  of  human  prostate-specific  antigen.  Bioch.  &  Biophy.  Res. 
Commun.  123,  482-488. 


21.  Chambers,  A.  F.,  and  Matrisian,  L.  M.,  1997.  Changing  views  of  the  role  of  matrix 
metalloproteinases  in  metastasis.  J.  Natl  Cancer  Inst.  89,  1260-1270. 

22.  Powell  W.  C.,  and  Martrisian,  L.  M.,  1996.  Complex  roles  of  matrix  metalloproteinases 
in  tumor  progression.  In:  Gunthert  U.  Birchmeier  W.  Editors.  Attempts  to 
understand  metastasis  formation  I:  metastasis-related  molecules.  Berlin:  Spring- 
Verlag,  pl-21. 

23.  Birkedal-Hansen,  H.,  Moore,  W.  G.,  Bodden,  M.  K.,  Windsor,  L.  J.,  Birkedal-Hansen, 
B.,  DeCarlo,  A.,  et  al,  1993.  Matrix  metalloproteinases:  a  review.  Crit.  Rev.  Oral  Biol. 
Med.  4, 197-250. 

24.  Graff,  J.  R.,  Herman,  J.  G.,  Myohanen,  S.,  Baylin,  S.  B.,  and  Vertino,  P.  M.  Mapping 
patterns  of  CpG  island  methylation  in  normal  and  neoplastic  cells  implicates  both 
upstream  and  downstream  regions  in  de  novo  methylation.  J.  Biol.  Chem.,  272, 
22322-22329, 1997. 


VIII.  Table  and  Figures 

Table  1.  The  Characters  of  the  DMGFs  isolated  by  MDD 


NAME 

CHROM 

ABNORMALITY 

GENOMIC/SEQ 

(bps) 

cDNA/SEQ 

(bps) 

PROTEIN 

BR50 

3pl2-14 

HypoM  in  BR/OV’ 

►  1005 

244 

Proteinase 

BR97 

12q24 

HypoM  in  BR/OV 

693 

BR104 

19 

HypoM  in  BR 

307 

BR254 

8qll-12 

HypoM  in  BR/OV 
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Breast  (BR)  and  Ovarian  (OV)  cancer. 


mRNA 
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TSP50 
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control 

probe 


Fig.  1.  Northern  blot  results  analyzed 
by  a  fragment  of  the  TSP50  gene.  The 
Human  Multiple  Tissue  Northern 
Blot  panel,  MTNTM  n,  containing  8 
different  tissues'  mRNA  (Clontech  Inc.) 
was  tested  to  determine  the  expression 
levels  of  the  gene.  Compared  to  the 
control,  the  human  Rab6  gene,  which 
was  evenly  expressed  in  all  tissues, 
TSP50  was  highly  expressed  in  the  testes 
tissue,  but  not  in  the  others.  No  single 
tissue  (total  eight)  in  another  Human 
Multiple  Tissue  Northern  Blot  panel 
was  hybridized  with  the  probe  (data  not 
shown). 


-59  gtcgtgggggcggc 

actgggagcgccttccggagagacgcagtcggctgccaccccggg 

1  atgggtcgctggtgccagaccgtcgcgcgcgggcagcgcccccgg 
MGRWCQTVARGQRPR 
4  6  acgtctgccccctcccgcgccggtgccctgctgctgctgcttctg 
TSAPSRAGALLLLLL 
91  ttgctgaggtctgcaggttgctggggcgcaggggaagccccgggg 
LLRSAGCWGAGEAPG 
136  gcgctgtccactgctgatcccgccgaccagagcgtccagtgtgtc 
ALSTADPADQSVQCV 
181  cccaaggccacctgtccttccagccggcctcgccttctctggcag 
PKATCPSSRPRLLWQ 
226  accccgaccacccagacactgccctcgaccaccatggagacccaa 
TPTTQTLPSTTMETQ 
271  ttcccagtttctgaaggcaaagtcgacccataccgctcctgtggc 

fpvsegkvdpyrscg 

316  ttttcctacgagcaggaccccaccctcagggacccagaagccgtg 

fsyeqdptlrdpeav 

361  gctcggcggtggccctggatggtcagcgtgcgggccaatggcaca 

arrwpwmvsvrangt 

406  cacatctgtgccggcaccatcattgcctcccagtgggtgctgact 

HICAGTIIASQWVLT 


451  gtggcccactgcctgatctggcgtgatgttatctactcagtgagg 
VAHCLIWRDVIYSVR 
496  gtggggagtccgtggattgaccagatgacgcagaccgcctccgat 
VGSPWIDQMTQTASD 
541  gtcccggtgctccaggtcatcatgcatagcaggtaccgggcccag 
VPVLQVIMHSRYRAQ 
586  cggttctggtcctgggtgggccaggccaacgacatcggcctcctc 
RFWSWVGQANDIGLL 
631  aagctcaagcaggaactcaagtacagcaattacgtgcggcccatc 
KLKQELKYSNYVRPI 
676  tgcctgcctggcacggactatgtgttgaaggaccattcccgctgc 
CLPGTDYVLKDHSRC 
721  actgtgacgggctggggactttccaaggctgacggcatgtggcct 
TVTGWGLSKADGMWP 
766  cagttccggaccattcaggagaaggaagtcatcatcctgaacaac 
QFRTIQEKEVI  ILNN 
811  aaagagtgtgacaatttctaccacaacttcaccaaaatccccact 
KECDNFYHNFTKIPT 
856  ctggttcagatcatcaagtcccagatgatgtgtgcggaggacacc 
LVQI  IKSQMMCAEDT 
901  cacagggagaagttctgctatgagctaactggagagcccttggtc 
HREKFC'Y  ELTGEPLV 
946  tgctccatggagggcacgtggtacctggtgggattggtgagctgg 
C  S  M  E  G  T  W  Y  L  V  G  L  V  S  W 
991  ggtgcaggctgccagaagagcgaggccccacccatctqcctacag 
GAGCQKSEAPPIYLQ 
1036  gtctcctcctaccaacactggatctgggactgcctcaacgggcag 
VSSYQHWIWDCLNGQ 
1081  gccctggccctgccagccccatccaggaccctgctcctggcactc 
ALALPAPSRTLLLAL 
1126  ccactgcccctcagcctccttgctgccctctgactctgtgtgccc 
PLPLSLLAAL  * 

1171  tccctcacttg 


Fig.  2.  The  nuclear  acid  and  amino  acid  sequence  of  TSP50. 
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Fig.  4.  DNA  methylation  status  of  the 
TSP50  gene  in  nine  normal  human 
tissues  examined  by  southern  blot.  In  a 
and  b,  the  results  obtained  from  Hpa  II 
and  Msp  I  digestion,  respectively.  6  |ig 
of  DNA  isolated  from  each  tissue  were 
cleaved  by  the  enzymes  and  subjected  to 
southern  analysis  by  probe  BR50.  a.  The 
results  show  that  bands  which  are 
approximately  1  kbp  and  2  kbp  in  length 
were  released  by  Hpa  II  only  in  the 
testes  tissue,  b.  A  2  kbp  band  was 
released  by  Msp  I  in  most  tissues.  In  a 
and  b.  The  control  probe  hybridizes  a 
single  band  in  each  tissue's  DNA,  which 
provides  proof  of  complete  enzymatic 
digestion. 
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Fig.  5.  The  results  of  RT-PCR  for 
differential  expression  of  TSP50 
in  five  out  of  eighteen  breast 
cancer  and  normal  control  tissues 
tested.  Lane  MW  represents  Hae 
III  ol74  markers  in  base  pairs. 
The  fl-actin  and  TSP50  lanes 
served  as  positive  controls,  and 
were  generated  by  using  cDNA 
prepared  from  testes  tissue  RNA. 
Lane  ft-actin  +  TSP50  contains 
simultaneously  generated  TSP50 
and  fi-actin  from  testes  cDNA. 
The  number  for  each  patient 
tested  is  listed  above  the  bracket. 
T  and  N  represent  tumor  and 
normal  tissues.  The  result  shows 
that  the  TSP50  gene  was 


abnormally  activated  in  approximately  30%  of  breast  cancer  patients. 


1  9 


Fig.  6.  The  results  of  In  Situ  Hybridization  of  an  advanced  cancer  specimen.  In  a,  the  anti- 
sense  probe  detected  TSP50  transcripts  stained  with  a  darker  brown  color  pointed  out  by 
arrows.  In  b,  the  anti-sense  probe  detected  TSP50  transcripts  stained  with  a  lighter  brown 
color  pointed  out  by  arrows.  In  c.  The  anti-sense  probe  did  not  stain  normal  breast  tissue 
(negative  control).  In  d.  the  sense  probe  did  not  cause  any  brown  color  staining  (negative 
control),  the  blue  staining  represents  the  counterstaining  by  hematoxylin.  Magnification 
150  X. 
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ABSTRACT 

A  novel  gene,  testes-specific  protease  50  (TSP50),  was  isolated  from  a 
human  testes  cDNA  library  by  using  a  genomic  DNA  probe,  BR50.  BR50 
was  isolated  by  a  modified  representational  difference  analysis  (RDA) 
technique  due  to  its  hypomethylated  feature  in  a  breast  cancer  biopsy. 
This  altered  DNA  methylation  status  was  also  detected  by  BR50  in  other 
breast  and  some  ovarian  cancer  tissues.  The  TSP50  gene  product  is  a 
homologue  to  several  human  proteases,  which  indicates  that  it  may  encode 
a  protease-like  protein.  Northern  analysis  of  16  different  types  of  normal 
human  tissues  suggests  that  TSP50  was  highly  and  specifically  expressed 
in  human  testes,  which  indicates  that  it  might  possess  a  unique  biological 
function(s)  in  that  organ.  Methylation  status  analysis  in  normal  human 
testes  and  other  tissues  showed  a  correlation  between  DNA  methylation 
and  gene  expression.  Most  importantly,  reverse  transcription-PCR  anal¬ 
ysis  of  18  paired  breast  cancer  tissues  found  that  in  28%  of  the  cancer 
samples,  the  TSP50  gene  was  differentially  expressed.  The  possibility  that 
TSP50  may  be  an  oncogene  is  presently  under  investigation. 

INTRODUCTION 

Abnormal  DNA  methylations  (hypomethylation  or  hypermethyla- 
tion)  have  been  linked  to  various  human  diseases  including  cancers 
(1-8),  Because  methylated  DNA  sites  are  usually  close  to  genes 
(9-13),  searching  for  differentially  methylated  DNA  fragments  in 
cancer  could  pinpoint  genes  of  interest.  Consequently,  a  modified 
RDA  technique  using  human  breast  cancer  biopsies  as  starting  mate¬ 
rial  was  used  to  search  for  differentially  methylated  DNA  fragments. 
Unlike  traditional  RDA3  (14),  to  perform  modified  RDA,  the  restric¬ 
tion  enzyme  Mspl ,  which  is  sensitive  to  the  methylated  GC-rich 
sequence  GGCmCGG  (15-18),  was  used  as  a  master  enzyme  to  cleave 
genomic  DNAs  for  amplicon  preparation.  Mspl  is  a  relatively  frequent 
cutting  enzyme  (19),  which,  when  used  alone,  produces  an  amplicon 
of  high  complexity  that  can  cause  unsuccessful  subtractive  hybridiza¬ 
tion.  Hence,  a  second  restriction  enzyme,  or  “partner  enzyme,”  has 
been  incorporated  into  the  technique.  The  amplicons  can  only  be  made 
by  PCR  from  DNA  fragments  with  both  ends  cut  by  the  Mspl  enzyme. 

As  a  result  of  using  this  modified  technique,  two  DNA  fragments, 
BR50  and  BR254,  were  isolated  that  detected  DNA  hypomethylation 
in  breast  cancer.  Additional  studies  verified  that  both  fragments  also 
detected  hypomethylation  in  ovarian  cancer,  and  BR254  was  ampli¬ 
fied  in  1  of  10  breast  cancer  biopsies.  On  the  basis  of  these  findings, 
we  considered  both  fragments  good  candidates  to  search  for  genes  that 
might  be  related  to  various  malignancies.  This  report  will  focus  on 
presenting  the  detailed  studies  related  to  BR50 ,  which  covers  its  own 
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isolation  as  a  differentially  methylated  DNA  fragment,  to  its  utiliza¬ 
tion  in  the  discovery  of  a  novel  gene,  TSP50. 

Investigation  of  the  TSP50  gene  has  found  that  it  encodes  a  pro- 
tease-like  protein.  Northern  analysis  of  multiple  human  tissue  RNA 
expression  panels  showed  that  TSP50  is  a  tissue-specific  gene,  which 
was  heavily  expressed  in  human  testes.  There  were  almost  no  visible 
amounts  of  TSP50  transcript  displayed  in  the  other  15  types  of  human 
tissues  in  the  panels.  This  result  indicates  that  the  TSP 50  gene  holds 
a  special  physiological  function(s)  in  human  testes.  The  DNA  meth¬ 
ylation  status  of  the  downstream  region  of  the  gene  in  normal  human 
testes  and  eight  other  tissues  was  also  examined.  Apparently,  DNA 
methylation  silences  the  TSP50  gene  expression  in  those  eight  normal 
tissues,  whereas  DNA  demethylation  in  human  testes  could  be  a  key 
element  responsible  for  gene  expression.  Furthermore,  RT-PCR  was 
performed  to  examine  differential  expression  in  breast  cancer  and 
matched  normal  control  tissues.  We  found  that  —28%  of  the  cancer 
samples  tested  expressed  the  TSP50  gene,  whereas  the  corresponding 
controls  did  not.  Whether  there  is  a  relationship  between  gene  expres¬ 
sion  and  cancer  development  is  presently  under  investigation. 

MATERIALS  AND  METHODS 

DNA  from  Human  Cancer  Biopsies.  Dissected  human  breast  and  ovarian 
cancer  tissues  (tumor  and  matched  normal)  were  immediately  frozen  in  liquid 
nitrogen  and  stored  at  -70°C.  DNAs  were  isolated  from  those  tissues  by  the 
phenol  extraction  method  (20). 

Modified  RDA.  One  to  two  /xg  of  DNA  (tester)  isolated  from  human  breast 
cancer  biopsies  and  their  matched  normal  DNA  (driver)  were  cleaved  with 
Mspl  (20  units//xl;  Boehringer  Mannheim)  and  Mse I  (20  units//xl;  New  Eng¬ 
land  Biolabs,  Inc.)  in  a  50-/xl  reaction  for  3  h.  To  prepare  tester  and  driver 
master  amplicons,  the  Mspl-  and  Mvel-digested  tester  and  driver  genomic 
DNAs  were  ligated  to  1.5  /xg  of  MSA24-mer  and  0.75  /xg  of  MSA12-mer 
(Table  1);  these  were  the  first  pair  of  oligonucleotide  linkers  that  only  recog¬ 
nize  the  ends  generated  by  Mspl.  The  procedures  for  amplicon  preparation 
were  performed  as  described  (14).  The  DNA  amplicons  were  then  purified  by 
phenol,  phenol/chloroform  extraction.  To  remove  the  first  set  of  linkers  from 
the  driver  amplicon,  80  /xg  of  driver  amplicon  DNA  were  digested  with  the 
Mspl  enzyme  (10  units//xl).  To  change  the  tester  master  amplicon  DNA 
linkers,  5  /xg  of  tester  master  amplicon  were  digested  with  Mspl  (20  units//xl) 
and  ligated  to  0.6  /xg  of  MSB24-mer  and  0.3  /xg  of  MSB12-mer  (Table  1); 
these  were  the  second  set  of  oligonucleotide  linkers.  Subtractive  hybridization 
was  performed  as  described  (14).  The  first  round  of  difference  products  (DPI) 
were  amplified  as  described  (14).  To  prepare  the  second  round  of  subtractive 
hybridization,  3  /xg  of  DPI  were  digested  with  the  restriction  endonuclease 
Mspl  (20  units//xl).  To  put  a  new  set  of  linkers  on  DPI,  0.1  /xg  of  DPI  was 
mixed  with  0.6  /xg  of  MSC24-mer  and  0.3  /xg  of  MSC12-mer  (Table  1). 
Another  round  of  subtractive  hybridization/PCR  amplification  was  repeated. 
The  second  round  of  difference  products  (DP2)  usually  contained  several 
individual  DNA  fragments  when  electrophoresed  on  a  2%  agarose  gel.  The 
individual  DNA  fragments  were  purified  by  DNA  gel  extraction  kit  (Qiagen, 
Inc.)  and  subcloned  into  pUCl  18  vector,  which  was  linearized  by  the  restric¬ 
tion  endonuclease  Accl  and  transformed  into  Escherichia  coli  ( DH5a ).  Twelve 
cloned  inserts  were  chosen  to  be  amplified,  from  which  different- si  zed  probes 
were  selected  for  master  amplicon  Southern  blot.  The  candidate  probes  were 
then  further  tested  by  human  genomic  DNA  Southern  blot. 

Amplicon  DNA  Southern  Blot.  The  first  round  of  positive  probe  screen¬ 
ing  was  performed  with  amplicon  DNA  Southern  blots.  Non-Radiation  South- 
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Table  1  Sequence  of  the  oligonucleotides  for  PCR  amplification 


Primer 

Sequence 

MSA24 

5 '  - CTCGTCGTC AGGTCAGTGCTTC AC - 3 ' 

MSA12 

5 '  -CGGTGAAGCACT- 3 ' 

MSB24 

5 '  -TAGAGCCACGTAGCTGCTGTAGTC - 3 ' 

MSB12 

5 '  -CGGACTACAGCA- 3 ' 

MSC24 

5 '  - AC  C  GTGG AC TGGAT AGGTTC AG AC - 3 ' 

MSC12 

5'-CGGTCTGAACCT-3' 

em  Blot  and  Detection  kits  (Genius)  were  purchased  from  Boehringer  Mann¬ 
heim.  Probe  labeling  and  detection  followed  the  instructions  of  the 
manufacturer.  Two  to  three  /big  of  tester  and  driver  amplicon  DNA  were 
electrophoresed  on  a  2%  agarose  gel  and  blotted  to  positively  charged  nylon 
membranes  (Boehringer  Mannheim).  For  prehybridization,  the  membranes 
were  placed  at  68°C  for  2—4  h  in  solutions  containing  6X  SSC,  5X  Denhardt  s 
solution,  0.5%  SDS,  0.1  M  EDTA,  and  50  /xg/ml  of  salmon  sperm  DNA.  Under 
the  same  conditions,  the  probes  were  added  and  hybridized  to  the  membranes 
overnight.  The  membranes  were  then  rinsed  three  times  with  2X  SSC,  1 X  Blot 
wash  (12  mM  Na2HP04,  8  mM  NaH2P04,  1.4  mM  Na4P207,  and  0.5%  SDS)  at 
68°C  and  further  washed  three  times  (30  min  each)  with  the  same  buffer  at 
68°C.  Next,  the  membranes  were  equilibrated  in  buffer  A  (100  mM  Tris*HCl, 
150  mM,  pH  7.5)  and  transferred  into  buffer  B  (2%  block  reagent  in  buffer  A), 
which  was  incubated  at  room  temperature  for  1  h.  The  membranes  were  then 
washed  2  times  for  15  min  with  buffer  A  and  equilibrated  in  buffer  C  (100  mM 
Tris'HCl,  100  mM  NaCl,  and  10  mM  MgCl2).  Before  the  membranes  were 
exposed  on  Kodak  X-OMAT  film  for  1  h,  they  were  rinsed  in  lumi-P530  for 

I  min  and  kept  in  a  plastic  sheet  protector. 

Genomic  DNA  Southern  Blot.  Genomic  DNAs  were  digested  with  a 
desired  restriction  enzyme  (20  units/jul)  and  electrophoresed  on  1.5%  agarose 
gels,  which  were  then  transferred  to  Hybond  N"  membranes  (Amersham). 
These  membranes  were  exposed  to  UV  light  to  immobilize  the  DNA.  Probes 
for  the  Southern  blot  were  labeled  with  High  Prime  DNA  labeling  kits 
(Boehringer  Mannheim)  following  the  instructions  of  the  manufacturer.  The 
procedure  for  hybridization  and  blot  wash  were  the  same  as  in  the  Amplicon 
DNA  Southern  blot  section. 

Northern  Analysis.  Two  Human  Multiple  Tissue  Northern  blot  panels, 
MTN  and  MTN II,  were  purchased  from  Clontech,  Inc.  The  MTN  blot  contains 
~2  pig  of  poly(A)+  RNA  per  lane  from  eight  different  human  tissues  (heart, 
brain,  placenta,  lung,  liver,  skeletal  muscle,  kidney,  and  pancreas).  The  MTN 

II  blot  contains  the  same  amounts  of  mRNA  from  an  additional  eight  different 
human  tissues  (spleen,  thymus,  prostate,  testes,  ovary,  small  intestine,  colon, 
and  peripheral  blood  leukocyte).  The  labeling  and  detection  of  the  probes  were 
the  same  as  above. 

DNA  Sequence  and  Chromosome  Assignment.  The  pUC118  plasmid 
containing  the  candidate  DNA  fragment  was  sequenced  using  the  Ampli-Cycle 
sequencing  kit  (Perkin-Elmer),  under  conditions  described  by  the  manufac¬ 
turer.  Chromosome  assignment  for  the  candidate  DNA  fragment  was  deter¬ 
mined  by  genomic  Southern  blot  of  the  Hindlll  digested  monochromosomal 
human/rodent  somatic  cell  hybrid  mapping  panel  #2  (NIGMS  Human  Genetic 
Mutant  Cell  Repository)  while  it  was  used  as  a  probe.  Fine  chromosome 
mapping  was  performed  with  GeneBridge  4  Radiation  Hybrid  Panel  (Research 
Genetics,  Inc.)  by  PCR  amplification  (21). 

Human  Genomic  DNA  Library  Screening.  A  human  placenta  genomic 
phage  library,  EMBL3  SP6/T7  (Clontech,  Inc.),  was  used  for  cloning  a  longer 
genomic  fragment  containing  the  candidate  fragment.  Phage  infection  proce¬ 
dure  was  based  on  the  instructions  supplied  by  the  manufacturer.  Plaques 
(2  X  106)  were  evenly  distributed  on  20  plates  (150  X  15  mm),  then  trans¬ 
ferred  onto  Hybond  N“  membranes.  The  treatment  of  the  membranes,  prep¬ 
aration  of  the  probe,  and  the  blot  wash  were  the  same  as  that  described  in  the 
Genomic  DNA  Southern  blot  section.  The  phage  DNA,  with  human  DNA 
insert,  was  purified  by  the  Lambda  TRAP  Plus  kit  (Clontech,  Inc.)  following 
the  instructions  of  the  manufacturer.  The  individual  insert  was  released  by 
restriction  enzyme  Sstl  cleavage  from  the  phage  DNA  arms,  then  subcloned 
into  pUC118  plasmid. 

Human  cDNA  Library  Screening.  A  human  testes  Agtll  cDNA  library 
(Human  Testis  cDNA  Library;  Clontech,  Inc.)  was  used  to  obtain  an  intact 
gene  following  the  instructions  of  the  manufacturer.  Plaques  (2  X  105)  evenly 
distributed  on  six  plates  (150  X  15  mm)  were  transferred  onto  Hybond  N“ 


membranes.  Southern  analysis  was  performed  as  before.  The  phage  DNA,  with 
human  cDNA  insert,  was  purified  by  the  A  Quick!  Spin  kit  (BIO  101,  Inc.) 
following  the  instructions  of  the  manufacturer.  The  individual  insert  was 
released  by  restriction  enzyme  EcoRl  cleavage  from  the  phage  DNA  arms, 
then  subcloned  into  pUC118  plasmid. 

RT-PCR.  Total  RNAs  were  isolated  from  paired  breast  cancer  and  normal 
tissues  by  RNA  isolation  kit,  RNA  STAT-60  (TEL-TEST,  Inc.).  The  first- 
strand  cDNA  was  synthesized  by  Superscript  Preamplification  system  kit  (Life 
Technologies,  Inc.).  Oligomers  E  (ACCAGAGCGTCCAGTGTGTCC,  sense) 
and  F  (TGGGACTTGATGATCTGAACC,  antisense)  were  used  to  synthesize 
the  TSP50  gene.  The  predicted  size  was  699  bp.  fi-actin  was  used  as  an  internal 
control,  the  sense  and  antisense  primers  of  which  were  5 '  -GACGACATG- 
GAGAAGATCTGG-3 '  and  5 ' -TGTAG AGGTAGTC AGTC AGG-3 ' .  The  pre¬ 
dicted  size  for  J8 -actin  was  335  bp.  The  PCR  reaction  mixture  was  comprised 
of  cDNA  derived  from  125  ng  of  RNA,  10  pmol  of  sense  and  antisense  primers 
from  both  TSP50  and  (3-actin,  200  pM  of  four  deoxynucleotide  triphosphate, 
and  0.125  unit  of  Taq  DNA  polymerase  with  reaction  buffer  (Perkin-Elmer)  in 
a  final  volume  of  25  pil.  Thirty-eight  cycles  of  PCR  were  carried  out.  Each 
cycle  of  PCR  included  30  s  of  denaturation  at  95°C,  60  s  of  annealing  at  60°C, 
and  60  s  of  extension  at  72°C.  The  PCR  products  were  separated  on  a  2% 
agarose  gel. 

RESULTS 

Isolation  of  Hypomethylated  Sequences  from  Human  Breast 
Cancer  Biopsies.  The  DNAs  isolated  from  three  paired  human  breast 
cancer  biopsies  (tester)  and  surrounding  normal  tissues  (driver)  were 
cleaved  with  the  Mspl  and  Mse I  enzymes.  The  tester  and  driver 
amplicons  with  both  ends  cleaved  by  Mspl  were  selectively  prepared 
by  PCR  amplification  (see  “Materials  and  Methods”).  After  two 
rounds  of  DNA  hybridization/subtraction  and  PCR  amplification, 
individual  fragments  (DP2)  were  isolated  from  two  breast  cancer 
patients  (see  “Materials  and  Methods”).  The  DP2  fragments  were 
subcloned  into  the  pUC118  vector,  and  the  inserts  were  amplified  by 
PCR.  Twelve  different-sized  inserts  were  selected  from  each  modified 
RDA  and  used  as  probes  for  the  master  amplicon  Southern  blot.  Two 
probes,  BR50  and  BR254}  isolated  from  two  patients,  were  identified 
as  candidate  probes  for  additional  study.  In  this  report,  we  focus  on 
presenting  the  work  that  has  been  done  by  probe  BR50. 

Probe  BR50  was  selected  from  the  DP2  isolated  from  breast  cancer 
patient  no.  14’ s  biopsy  by  the  modified  RDA  technique  (Fig.  la) 
because  it  hybridized  a  band  of  much  greater  intensity  in  the  tester 

a  b 

BR14 

probe  i - 1 

M  A  #50  T  D 


Fig.  1.  a,  the  agarose  gel  electrophoresis  of  the  final  difference  products  isolated  by  a 
modified  RDA  technique  from  the  breast  cancer  biopsy  of  patient  no.  14.  Lane  M,  Haelll 
<£174  DNA  size  markers  in  bp.  Lane  A,  DNA  fragments  from  which  BR50  was  isolated. 
b,  the  amplicon  Southern  Blot  results  for  BR50.  Left  lane,  probe  BR50  hybridizes  to  itself 
as  a  positive  control.  In  Lanes  T  and  D,  probe  BR50  hybridizes  with  2  pg  of  tester  (tumor) 
and  driver  (normal)  master  amplicon  DNA  prepared  from  breast  cancer  patient  no.  14.  The 
tester  master  amplicon  DNA  displays  a  much  heavier  hybridized  band  than  the  driver 
master  amplicon  DNA.  The  picture  below  the  Southern  blot  is  the  tester  and  driver 
amplicon  agarose  gel  electrophoresis  before  transferring  it  onto  the  blot  membrane,  which 
served  as  the  DNA  quantitative  control. 
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amplicon  than  that  in  the  driver  amplicon  (Fig.  1  b).  To  confirm  the 
differences  observed  in  the  tester  and  driver  amplicons,  a  Southern 
analysis  was  performed  on  patient  no.  14’s  tumor  and  matched  normal 
genomic  DNAs.  Six  /xg  of  each  DNA  were  cleaved  with  the  Mspl 
enzyme  and  hybridized  by  probe  BR50 .  The  results  showed  that  in  the 
tumor  DNA,  the  probe  hybridized  a  lower  band,  ~1  kbp  long,  of 
much  greater  intensity  than  an  upper  band,  which  was  ~2  kbp  in 
length.  In  the  normal  DNA,  just  the  opposite  occurred  (Fig.  2). 
Because  the  sizes  of  the  upper  and  lower  bands  in  the  tumor  and 
normal  control  DNAs  were  the  same,  the  only  reasonable  explanation 
causing  uneven  hybridization  intensities  is  DNA  hypomethylation  in 
the  tumor  cells.  To  examine  whether  the  event  also  existed  in  other 
breast  cancer  patients,  paired  tumor  and  normal  DNAs  isolated  from 
additional  breast  cancer  biopsies  were  cleaved  with  Mspl  and  sub¬ 
jected  to  Southern  analysis.  The  results  showed  that  of  10  samples 
tested,  4  had  similar  hybridization  patterns  to  those  of  patient  no.  14 
(Fig.  2).  It  is  notable  that  in  the  normal  DNAs  of  patient  nos.  3,  4,  and 
5,  more  than  one  upper  band  was  evident.  We  believe  this  can  be 
attributed  to  partial  DNA  demethylation,  instead  of  incomplete  enzy¬ 
matic  digestion.  This  is  because  the  Southern  membrane  was  reblotted 
by  a  control  probe,  which  was  a  background  probe  isolated  along  with 
probe  BR50 ,  and  only  a  single  hybridized  band  was  displayed  in  each 
lane  (Fig.  2). 

BR50  Also  Detected  Altered  Methylation  Patterns  in  Ovarian 
Cancer.  To  examine  whether  the  hypomethylation  event  also  oc¬ 
curred  in  human  ovarian  cancer,  paired  DNA  samples  isolated  from 
eight  ovarian  cancer  patients  were  analyzed  by  probe  BR50.  The 
DNAs  were  cleaved  with  Mspl  and  hybridized  with  probe  BR50  in  a 
Southern  blot  experiment.  The  results  demonstrated  that  of  eight 
patients  tested,  four  displayed  similar  hybridization  patterns  to  those 
observed  in  the  breast  cancer  samples.  In  the  tumor  DNAs,  the  lower 
band  was  heavily  hybridized  by  the  probe,  whereas  in  the  normal 
control  DNAs,  a  lower  and  upper  band  were  hybridized.  The  com¬ 
pletion  of  DNA  digestion  was  confirmed  by  the  same  control  probe 
used  before,  which  only  hybridized  a  single  band  in  each  lane  (Fig.  3). 
Thus,  BR50  also  detected  altered  DNA  methylation  in  ovarian  cancer. 
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Fig.  2.  The  breast  cancer  genomic  DNA  Southern  blot  for  probe  BR50.  Left  lane  of 
each  blot,  probe  BR50  hybridizes  to  itself  as  a  positive  control.  BR50  hybridizes  with 
Afopl-digested  6  /xg  of  original  tumor  (7)  and  matched  normal  (AO  genomic  DNAs  isolated 
from  patient  no.  14  and  other  breast  cancer  patients.  The  number  for  each  patient  tested 
is  listed  above  the  bracket.  Similar  hypomethylation  patterns  in  patient  no.  14  and  the 
other  patients  are  observed.  The  control  probe  in  the  lower  section  served  as  an  indicator 
of  complete  enzymatic  digestion. 
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Fig.  3.  The  different  methylation  patterns  displayed  by  probe  BR50  in  ovarian  cancer. 
The  number  for  each  patient  is  listed  above  the  bracket.  Lanes  T  and  N  include  6  /xg  of 
tumor  DNA  and  normal  DNA  digested  with  Mspl,  respectively.  Tumor  and  matched 
normal  DNAs  were  cleaved  with  Mspl  and  probed  by  BR50  in  a  Southern  blot  experiment. 
Methyl-differential  patterns  are  detected  by  BR50.  The  completion  of  the  enzyme  diges¬ 
tion  was  confirmed  by  the  control  probe,  which  only  hybridized  a  single  band  in  each  lane. 

Sequencing  and  Chromosome  Assignment  of  Probe  BR50. 

DNA  sequencing  found  that  BR50  contains  1005  bps,  with  a  CG 
content  of  58%  (Fig.  4 a).  The  DNA  fragment  has  a  GGCCGG 
sequence  on  one  end  and  CCGG  sequence  on  the  other  end.  Because 
Mspl  is  sensitive  to  GGCmCGG  and  not  sensitive  to  CCmGG  se¬ 
quences,  it  is  conceivable  that  the  GGCCGG  sequence  was  methyl¬ 
ated  in  normal  breast  tissue  DNA  while  being  hypomethylated  in 
tumor  DNA.  This  prediction  probably  holds  true  because  all  of  the 
candidate  probes,  including  probe  BR254  and  subsequent  probes 
isolated  from  breast  cancer  biopsies,  have  one  end  terminating  with  a 
GGCCGG  sequence  and  the  other  with  a  CCGG  sequence  (data  not 
shown).  A  homologue  search  of  the  NIH  GenBank  discovered  that 
BR50  is  not  homologous  to  any  existing  sequences.4  The  chromosome 
assignment  determined  by  monochromosomal  human/rodent  somatic 
cell  hybrid  mapping  panel  #2  established  its  location  on  chromosome 
3  (data  not  shown).  The  fine  chromosome  position  of  BR50  was 
analyzed  by  PCR  amplification  using  GeneBridge  4  Radiation  Hybrid 
Panel  as  templates.  The  top  and  bottom  strand  primers  for  BR50  are 
5'-ACCAGATGGAGGCAGTTGAC-3'  and  5 '  -  A  AGTGGGTGCT- 
CTTTCTGTG-3',  respectively.  The  result  obtained  from  radiation 
hybrid  mapping  suggests  that  BR50  is  placed  4.29  cR  below  the 
adjacent  STS,  AFMB362WB9,  which  is  179.84  cR  from  the  top  of  the 
chromosome  3  linkage  group.  This  result  confirmed  that  BR50  is 
proximally  located  on  3pl2-14. 

Search  for  the  Coding  Regions  by  Probe  BR50 .  On  the  basis  of 
our  preliminary  analysis,  BR50  was  an  interesting  probe  to  be  used  for 
an  adjacent  gene  search  because  it  was  differentially  methylated  in 
some  breast  and  ovarian  cancer  samples  tested,  although  the  sample 
size  was  small.  BR50  is  a  genomic  fragment,  and  consequently  the 
chances  that  it  encodes  a  polypeptide  are  slim  because  a  majority  of 
human  genomic  DNA  sequences  are  noncoding  sequences.  As  a 
result,  we  decided  that  the  first  step  in  searching  for  a  gene  should  be 
to  isolate  a  longer  DNA  piece  from  a  human  genomic  DNA  library. 


4  GenBank  accession  numbers:  BR50,  U78781;  TSP50,  AF100707. 
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CCGGAGGCAGGCACAGGACTCGGGAGGGACGCTGCCAGCTCTCTGGGTGCTGAGTTCACAAGGCTG 

CATTCATGATTTTCAATAGACCTGTGATGGTCTGTGCCCAGTGCTGGGGACACAGAAGAGTCAAAC 

CTGGCTCCTGACCTGGACCTGGATCATCACGTGACAGGGAGGAGAGCGATCCAGGCTGATGAGGAA 

AGCGCATGACATGGGGTCTTAGGAGCAGTGAGGGGCAGAGCCATGGCCAAAGGCCCCGCCATGGAA 

GCTGAGGACTCTGGCACCAGATGGAGGCAGTTGACGGACCTCTGCCCTTGGGGTCCAACCCATGGG 

CTTCTCATACATAGGGGTGAAAAAGGCCATTCTATTTATGCAGAATTTTCCCATGTGGCCAGGCAG 

CAGAAGTCCAGAGGGGTAGGGGCCACTCAGGGTCACACAGAACAGCAGTTGCTGAAGACTGGGGAA 

GTCCAGGCCTAGGCTCCACCTGCCCTTCCCCTGACATGGGGCCACCACTAGCCTTTTATGGGCAGG 

CCTGGCTGCTGGTGGTTGGAATAACATCTGACTCCAGTGGGTGTCTGTCACCGTCTCCAGACAGGA 

GACAGAGACAGAGGGTCAAAGTTCACTATGGCTCTTTGGGGCAATGAAATGCTGTGTTCTAGCCTC 

TTGCCAGAAATCAGCCAAAGTCAAGGAAAGCCTGACTCCCACAGTTATCACAGAAAGAGCACCCAC 

TTTCCAGCCCAGACAGCTGCACCCCAGCTGGGTCCTGGCAGCCCCAGCTTCAGCCTGGGCGGTATG 

TTCCAGGCCCCTCGATCATCTGACCCTAATATCACCCCTTCACACCCCCTCCACTTTCTGCGGGAG 

CCACCCCGAACCTTTGAATGGGGGAGATCCTGGAGGCTCTGCAATTTTCAGTGTAAACTGCCTGGA 

GTTCCCCACTTCACCCTCATCTGGTTCACCTGTGGACTCCCAACAGAGCAGGCCCAGGAAACGCGG 

GGCCTCTGAGGCCGG 


-59  gtcgtgggggcggc 

actgggagcgccttccggagagacgcagtcggctgccaccccggg 


Fig.  4.  a ,  the  sequence  of  the  genomic  DNA  probe  BR50.  b,  nucle¬ 
otide  and  predicted  amino  acid  sequences  of  the  human  TSP50  gene.  The 
adenosine  at  the  ATG  (bold-type)  initial  codon  is  considered  the  number 
1  nucleotide.  The  stop  codon,  TGA,  is  also  in  boldface .  The  sequences 
of  exon  BR50-44  and  BR50-45  are  in  italics. 


1  atgggtcgctggtgccagaccgtcgcgcgcgggcagcgcccccgg 
MGRWCQTVARGQRPR 

4 6  acgtctgccccctcccgcgccggtgccctgctgctgctgcttctg 
TSAPSRAGALLLLLL 
91  ttgctgaggtctgcaggttgctggggcgcaggggaagccccgggg 
LLRSAGCWGAGEAPG 
136  gcgctgtccactgctgatcccgccgaccagagcgtccagtgtgtc 

alstadpadqsvqcv 

181  cccaaggccacctgtccttccagccggcctcgccttctctggcag 
PKATCPSSRPRLLWQ 
226  accccgaccacccagacactgccctcgaccaccatggagacccaa 

tpttqtlpsttmetq 

271  ttcccagtttctgaaggcaaagtcgacccataccgct  cctgtggc 

fpvsegkvdpyrscg 

316  ttttcctacgagcaggaccccaccctcagggacccagaagccgtg 

fsyeqdptlrdpeav 
361  gctcggcggt ggccc tgga t ggt ca gcgt gcgggcca atggcaca 
A  R  R  WPWMVSVRANGT 
406  cacatctgtgccggcaccatcattgcctcccagtgggtgctgact 

hicagtiiasqwvlt 
451  gtggcccactgcctga fcctggcgtgatgttatctactcagtgagg 
VAHCLIWRDVIYSVR 
496  gtggggagtccgtggattgaccagatgacgcagaccgcctccgat 
VGSPWIDQMTQTASD 
541  gt cccggtgctccaggtcatcatgcatagcaggtaccgggcccag 
VPVLQVIMHSRYRAQ 
586  cggttctggtcctgggtgggccaggccaacgacatcggcctcctc 
RFWSWVGQANDIGLL 
631  aagctcaagcaggaactcaagtacagcaattacgtgcggcccatc 
KLKQELKYSNYVRPI 
67  6  tgcctgcctggcacggactatgtgttgaaggaccattcccgctgc 
CLPGTDYVLKDHSRC 
721  actgtgacgggctggggactttccaaggctgacggcatgtggcct 
TVTGWGLSKADGMWP 
766  cagttccggaccattcaggagaaggaagtcatcatcctgaacaac 

QFRTIQEKEVI  ILNN 
811  aaagagtgtgacaatttctaccacaacttcaccaaaatccccact 
KECDNFYHNFTKIPT 
856  ctggttcagatcatcaagtcccagatgatgtgtgcggaggacacc 

lvqiiksqmmcaedt 
901  cacaggqagaaqttctgctatgagctaactggagagcccttggtc 
HREKFCY  ELTGEPLV 
946  tgctccatggagggcacgtggtacctggtgggattggtgagctgg 
CSMEGTWYLVGLVSW 
991  ggtgcaggctgccagaagagcgaggccccacccatctacctacag 
GAGCQKSEAPPIYLQ 
1036  gtctcctcctaccaacactggatctgggactgcctcaacgggcag 
V  S  S  YQHWIWDCLNGQ 

1081  gccctggccctgccagccccatccaggaccctgctcctggcactc 

alalpapsrtlllal 

1126  ccactgcccctcagcctccttgctgccctctgactctgtgtgccc 
PLPLSLLAAL  * 

1171  tccctcacttg 
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with  the  hope  that  it  may  contain  exon(s).  It  is  well  known  that  DNA 
methylation  sites  can  be  near  genes  (18);  therefore,  we  decided  to 
screen  a  human  placenta  genomic  phage  library,  EMBL3  SP6/T7 
(Clontech,  Inc.),  where  the  insertion  sizes  were  relatively  small  (9-28 
kbp).  For  this  purpose,  2  X  106  plaques  derived  from  the  library  were 
screened  by  probe  BR50.  As  a  result,  a  17-kbp  length  clone  was 
isolated.  To  obtain  the  sequence  information,  the  DNA  clone  was 
released  from  the  phage  DNA  arms  by  restriction  enzyme  Sstl  cleav¬ 
age,  which  generated  eight  DNA  fragments.  All  eight  fragments  were 
subcloned  into  pUC118  plasmids.  The  fragments  smaller  than  1  kbp 
were  completely  sequenced,  whereas  the  fragments  larger  than  1  kbp 
were  partially  sequenced  from  both  the  plasmid  and  insert  junctions. 
A  homologue  search  of  the  NIH  GenBank  revealed  two  exons, 
BR50-44  and  BR50-45,  which  contained  112  and  132  nucleotides, 
respectively.  Both  exons  encoded  polypeptides  that  were  —50%  iden¬ 
tifiable  to  several  mammalian  proteases,  such  as  serine  proteases  and 
tryptases.  The  BR50-45  sequence  was  found  142  bp  upstream  of  the 
BR50  sequence. 

Because  we  did  not  know  the  exact  positions  of  the  two  exons  in  the 
17-kbp  fragment,  it  was  possible  that  other  exon(s)  might  lay  between 
them.  To  gain  a  longer  coding  sequence,  we  designed  four  oligomers 
(A,  B,  C,  and  D),  based  on  the  sequence  information  of  both  exons,  to 
perform  PCR.  Oligomer  A  (5 ' -CCTGGATGGTC AGCGTG-3 ' )  and  B 
(5 '  -CTGGG  AGGC  A  ATGATGGT-3 ' ),  which  were  on  the  compli¬ 
mentary  strand,  were  based  on  the  sequence  information  of  BR-44 ; 
and  C  (5 '-CTGGAGAGCCCTTGGTCT-3 ')  and  D  (S'-CAGTGTTG- 
GTAGGAGGAG-3'),  which  were  on  the  complimentary  strand,  were 
based  on  the  sequence  information  of  BR-45.  A  strategy  using  four 
different  combinations  of  oligomer  pairs  was  used  to  perform  PCR  by 
using  the  Human  Universal  cDNA  Library  Panel  (Clontech,  Inc.).  A 
PCR  product,  which  was  about  700  bp  in  length,  was  generated  from 
one  oligomer  combination  (A/D,  5' -CCTGGATGGTC AGCGTG-3 7 
5 ' -C  AGTGTTGGTAGG  AGGAG-3 ').  This  PCR  product  was  directly 
sequenced.  Combining  the  sequence  information  of  the  PCR  product 
and  the  two  exons,  we  obtained  a  cDNA  fragment  that  contained  974 
bps.  The  DNA  homologue  search  of  the  NIH  GenBank  revealed  again 
that  it  coded  for  a  protease-like  protein,  and  the  overall  identity  was 
-40%. 

The  Candidate  Gene  Is  Highly  and  Specifically  Expressed  in 
Human  Testes.  To  obtain  a  full-length  cDNA,  it  is  critical  to  use  the 
right  cDNA  library  where  the  gene  of  interest  is  expressed.  Thus,  two 
Human  Multiple  Tissue  Northern  blot  panels,  MTN  and  MTN  II, 
containing  16  different  tissue  mRNAs  (Clontech,  Inc.),  were  used  to 
test  the  expression  of  the  candidate  gene  by  using  the  700-bp  cDNA 
PCR  product  as  a  probe.  The  results  showed  that  there  were  no  visible 
transcripts  of  this  gene  in  the  eight  mRNAs  (heart,  brain,  placenta, 
lung,  liver,  skeletal  muscle,  kidney,  and  pancreas)  included  in  the 
MTN  panel  (data  not  shown).  In  the  MTN  II  panel,  a  1.7-kbp  band 
was  heavily  hybridized  by  the  probe  exclusively  in  the  testes  mRNA 
as  compared  with  the  control  probe,  which  was  the  human  rab6  gene 
(Fig.  5).  These  results  suggested  that  the  gene  that  we  were  searching 
for  is  a  tissue-specific  gene.  We  have  named  the  gene  TSP50.  At  this 
moment,  the  biological  function(s)  of  the  gene  in  human  testes  re¬ 
mains  unknown. 

Isolation  of  the  Full-length  TSP50  Gene  from  a  Human  Testes 
cDNA  Library.  TSP50  is  highly  expressed  in  human  testes;  to  search 
for  the  full-length  TSP50  gene,  a  human  testes  cDNA  library  (Human 
Testis  cDNA  Library;  Clontech.  Inc.)  was  screened  by  the  700-bp 
TSP50  cDNA  sequence.  A  cDNA  clone  containing  the  probe  se¬ 
quence  was  isolated.  Sequence  analysis  suggests  that  this  fragment 
encodes  a  protein  with  385  animo  acids  (Fig.  4b).  There  is  a  stop 
codon  located  at  the  117th  bp  upstream  of  the  first  initial  translation 
site,  and  there  is  a  125-bp  untranslated  region  before  a  polyadenosine 
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Fig.  5.  Northern  blot  results  analyzed  by  the  700-bp  TSP50  fragment.  The  Human 
Multiple  Tissue  Northern  blot  panel,  MTN  II,  containing  the  mRNA  of  eight  different 
tissues  (Clontech  Inc.),  was  tested  to  determine  the  expression  levels  of  the  gene. 
Compared  with  the  control  (the  human  Rab6  gene,  which  was  evenly  expressed  in  all 
tissues),  TSP50  was  highly  expressed  in  the  testes  tissue  but  not  in  the  other  tissues. 


sequence.  These  results  imply  that  a  full-length  gene  has  been  ob¬ 
tained.  It  is  also  notable  that  the  BR50  probe  sequence  is  located  at  the 
3'  end  of  the  gene  and  is  only  17  bp  downstream  from  the  poly  ade¬ 
nosine  adding  signaling  site.  The  exons  BR50-44  and  BR50-45  encode 
animo  acids  from  103  to  157  and  308  to  385,  respectively  (Fig.  4.  b.). 
The  3'-  untranslated  region  before  the  poly  adenosine  site  is  also 
included  in  the  sequence  of  BR50-45. 

DNA  Methylation  Status  of  the  TSP50  Gene  in  Human  Testes 
and  Other  Normal  Tissues.  Our  studies  have  proven  that  TSP50  is 
a  tissue-specific  gene,  and  the  methylation  patterns  in  its  3'  region 
were  altered  in  some  breast  and  ovarian  cancers.  It  is  also  known  that 
many  tissue  specific  genes  are  methylated,  and  this  methylation  may 
regulate  their  expression  (22-24).  To  explore  the  possible  relationship 
between  TSP50  gene  expression  and  DNA  methylation  in  different 
normal  human  tissues,  Southern  analysis  was  performed.  The  normal 
tissues  tested  included  the  testes,  where  TSP50  was  expressed,  and 
bladder,  blood,  breast,  colon,  lung,  kidney,  placenta,  and  ovary  sam¬ 
ples,  where  TSP50  was  apparently  not  expressed.  To  perform  the 
Southern  analysis,  BR50  was  used  as  a  probe.  DNAs  isolated  from  the 
nine  tissues  were  digested  by  Mspl  and  Hpall,  which  is  an  isoschi- 
zomer  of  the  Mspl  enzyme  and  the  most  popular  enzyme  used  to  study 
DNA  methylation  patterns  (25).  Hpall  digestion  showed  that  in  the 
testes  DNA,  two  bands,  probably  released  from  each  allele  by  enzyme 
cleavage,  were  hybridized  by  the  probe.  However,  in  the  DNAs  of 
other  tissues,  the  corresponding  bands  were  either  not  hybridized  or 
hybridized  to  a  much  smaller  degree  (Fig.  6a).  For  Mspl  cleavage, 
both  bands  were  released  in  different  tissues  to  various  extents  (Fig. 
6b).  Both  blots  used  a  genomic  fragment  that  did  not  detect  differen¬ 
tial  DNA  methylation  as  a  control  to  determine  complete  enzymatic 
digestion  (Fig.  6).  These  results  demonstrated  that  the  TSP50  gene 
was  differentially  methylated  in  various  human  tissues.  In  general, 
DNA  demethylation  in  the  testes  is  correlated  with  high  levels  of  gene 
expression.  Conversely,  DNA  methylation  is  correlated  with  gene 
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Fig.  6.  DNA  methylation  status  of  the  TSP50  gene  in  nine  normal  human  tissues 
examined  by  Southern  blot.  In  a  and  b,  the  results  obtained  from  Hpall  and  Mspl 
digestion,  respectively,  are  shown.  Six  p,g  of  DNA  isolated  from  each  tissue  were  cleaved 
by  the  enzymes  and  subjected  to  Southern  analysis  by  probe  BR50.  a,  the  results  show  that 
two  bands,  which  are  approximately  1  and  2  kbp  in  length,  were  released  by  Hpall  only 
in  the  testes  tissue,  b,  a  2-kbp  band  was  released  by  Mspl  in  most  tissues.  In  a  and  b,  the 
control  probe  hybridized  a  single  band  in  the  DNA  of  each  tissue,  which  provides  proof 
of  complete  enzymatic  digestion. 


silencing  in  the  bladder,  blood,  breast,  colon,  lung,  kidney,  placenta, 
and  ovary  tissues. 

TSP50  Was  Differentially  Expressed  in  Some  Breast  Cancer 
Tissues.  Preliminary  results  demonstrated  that  TSP50  was  differen¬ 
tially  methylated  in  40%  of  the  breast  cancer  tissues  tested.  This 
suggested  that  it  could  also  be  differentially  expressed  in  cancer.  To 
test  this  possibility,  RT-PCR  was  carried  out  to  determine  TSP50 
expression  levels  in  18  paired  breast  cancer  tissues.  Our  findings 
showed  that  TSP50  PCR  products  were  generated  in  five  tumor 
tissues,  whereas  in  their  normal  controls,  they  were  not  visible  relative 
to  the  control  gene,  fi-actin  (Fig.  7).  Products  generated  from  the  five 
patients  were  gel  purified  and  sequenced.  DNA  sequence  analysis 
confirmed  that  the  PCR  products  synthesized  were  the  TSP50  gene 
(data  not  shown).  Therefore,  among  the  18  paired  samples  tested,  28% 
of  the  tissues  expressed  the  TSP50  gene.  At  this  moment,  we  cannot 
answer  the  question  of  whether  activation  of  the  TSP50  gene  in  cancer 
is  a  consequence  or  a  causal  factor  of  neoplastic  growth.  To  find  the 
truth,  it  will  be  necessary  to  perform  in  vitro  cellular  transformation 
and  in  vivo  tumor  induction  assays. 


lated.  The  extensive  study  of  one  of  the  two  fragments,  BR50 ,  is  the 
subject  of  this  report. 

It  has  been  reported  that  aberrant  DNA  methylations  occur  con¬ 
stantly  in  human  tumors  (9-12,  24,  26).  DNA  hypomethylations 
could  activate  oncogenes,  whereas  DNA  hypermethylation  could  in¬ 
activate  recessive  oncogenes.  Both  events  could  result  in  neoplastic 
growth  (27-35).  The  correlation  between  aberrant  DNA  methylations 
and  malignancies  suggests  that  differentially  methylated  fragments  in 
tumors  isolated  by  a  modified  RDA  technique  could  be  a  valuable  tool 
in  the  search  for  genes  that  might  be  related  to  cancer  development. 
BR50  was  considered  to  hold  such  value  because  it  not  only  detected 
DNA  hypomethylations  in  the  original  breast  cancer  tissues  from 
which  it  was  isolated  but  also  detected  DNA  hypomethylations  in 
other  breast  and  ovarian  cancer  samples. 

Our  first  step  in  processing  the  gene  search  was  to  screen  a  human 
genomic  phage  library.  A  17-kbp  DNA  fragment  was  isolated,  and 
sequence  analysis  suggested  that  this  fragment  contained  at  least  two 
exons  that  were  homologous  to  mammalian  proteases.  The  sequencing 
information  of  the  exons  led  to  the  discovery  of  a  974-bp  gene 
fragment  from  a  human  cDNA  library  panel  by  PCR  amplification.  To 
obtain  a  full-length  gene,  a  Northern  evaluation  on  16  different  types 
of  human  RNAs  was  performed.  The  results  demonstrated  that  the 
target  gene  was  specifically  expressed  in  human  testes  tissue.  This 
information  secured  the  isolation  of  an  intact  gene,  TSP50 ,  by  screen¬ 
ing  a  human  testes  cDNA  library.  The  sequence  analysis  revealed  that 
the  TSP50  gene  encodes  a  protein  that  shares  ~40%  identity  with 
mammalian  proteases,  such  as  human  tryptase  or  mouse  serine  pro¬ 
tease.  This  would  suggest  that  the  product  of  the  TSP50  gene  is  a 
protease.  However,  at  this  point,  we  do  not  know  the  physiological 
function(s)  of  this  protease.  One  may  assume,  though,  that  it  could  be 
a  component  in  the  human  reproductive  pathway  due  to  it  being  solely 
expressed  in  the  testes. 

It  is  common  knowledge  that  the  expression  of  many  tissue-specific 
genes  is  regulated  by  DNA  methylations,  which  usually  modify  the 
promoter,  or  sometimes,  3'  regions  (3,  4,  22-24,  36).  Our  preliminary 
results,  although  only  obtained  from  analyzing  the  DNA  methylation 
status  of  the  3'  flanking  region  of  the  gene,  have  proven  that  TSP50 
is  one  of  those  tissue-specific  genes.  It  will  be  interesting  to  discover 
whether  the  promoter  region  of  the  gene  is  also  methylated  when  the 
corresponding  sequence  information  is  available.  The  Hpall  and  Mspl 
methylation-sensitive  Southern  analysis  of  the  3'  region  of  the  TSP50 
gene  demonstrated  that,  in  T/ptzII-digested  DNAs,  probe  BR50  hybrid¬ 
ized  two  bands  in  the  testes  tissue  but  none  in  the  other  samples.  The 
lower  band,  which  was  the  same  size  as  the  probe,  represented  the 
unmethylated  DNA  pattern,  whereas  the  upper  band  obviously  con- 
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DISCUSSION 

A  modified  RDA  technique  was  used  to  study  genetic  alterations  by 
using  breast  cancer  biopsies  as  a  working  model  system.  As  a  result, 
two  hypomethylated  genomic  DNA  fragments  were  successfully  iso- 


Fig.  7.  The  results  of  RT-PCR  for  TSP50  differentially  expressed  in  five  breast  cancer 
and  normal  control  tissues.  Lane  MW,  Hae III  0174  markers  in  bp.  The  fi-actin  and  TSP50 
lanes  served  as  positive  controls;  they  were  generated  by  using  cDNA  prepared  from 
testes  tissue  RNA.  Lane  fi-actin+TSP50  contains  simultaneously  generated  TSP50  and 
fi-actin  from  testes  cDNA.  The  number  for  each  patient  tested  is  listed  above  the  bracket. 
T  and  N,  tumor  and  normal  tissues,  respectively. 
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tained  the  internal  Hpall  recognition  site(s),  which  remained  methy¬ 
lated.  In  Mspl-digested  DNAs,  the  upper  band  was  dominant  in  most 
tissues,  whereas  in  the  testes,  the  lower  band  was  dominant.  These 
results  suggest  that  the  GGCCGG  end  of  BR50  was  methylated  in 
other  tissues  but  not  in  the  testes.  The  DNA  methylation  patterns 
observed  in  both  blots  are  probably  allelic  orientated.  It  seems  that 
DNA  hypomethylation  was  accompanied  by  the  expression  of  the 
gene  in  the  testes,  and  conversely,  DNA  hypermethylation  was  ac¬ 
companied  by  the  silencing  of  the  gene  in  other  tissues.  The  correla¬ 
tion  between  DNA  methylation  and  gene  expression  provided  addi¬ 
tional  proof  that  DNA  methylation  could  be  an  important  mechanism 
in  governing  the  expression  of  the  genes  in  various  differentiated 
human  cells  (12,  37,  38).  In  addition,  the  differential  expression  of  the 
TSP50  gene  has  been  tested  in  18  paired  breast  cancer  biopsies.  Our 
findings  have  shown  that  this  gene  was  activated  in  five  cancer 
samples.  In  the  near  future,  more  samples  from  different  types  of 
cancer  will  be  examined,  and  the  possibility  that  the  TSP50  gene 
product  might  be  one  of  the  factors  that  stimulate  human  cancer  will 
be  further  explored. 

Recently,  by  using  the  same  technique,  DNA  fragments  that  rep¬ 
resent  DNA  amplifications,  deletions,  and  rearrangements  were  also 
obtained  (data  not  shown).  Hopefully,  this  technique  will  lead  to  the 
discovery  of  additional  novel  genes  that  may  be  related  to  cancer 
development.  On  the  basis  of  our  experience,  the  process  of  isolating 
the  TSP50  gene  was  made  considerably  easier  by  the  modified  tech¬ 
nology,  where  the  MspI  enzyme  was  used  as  the  master  enzyme.  The 
ability  of  Mspl  to  recognize  GC-rich  sequences  and  its  sensitivity  to 
DNA  methylation  (17,  18)  apparently  accelerated  our  gene  search. 
Furthermore,  the  double  enzyme  cleavage  strategy  provides  another 
unique  and  efficient  feature  for  this  technique  because  ~40%  of  a 
human  genome  can  theoretically  be  analyzed  by  a  single  master 
enzyme  when  it  is  combined  with  a  different  partner  enzyme. 
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